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Abstract. We discuss a number of properties of CMV matrices, by which we 
mean the class of unitary matrices recently introduced by Cantero, Moral, and 
Velazquez. We argue that they play an equivalent role among unitary matrices 
to that of Jacobi matrices among all Hermitian matrices. In particular, we 
describe the analogues of well-known properties of Jacobi matrices: foliation by 
co-adjoint orbits, a natural symplectic structure, algorithmic reduction to this 
shape, Lax representation for an integrable lattice system (Ablowitz-Ladik), 
and the relation to orthogonal polynomials. 

As offshoots of our analysis, we will construct action/angle variables for the 
finite Ablowitz-Ladik hierarchy and describe the long-time behaviour of this 
system. 



1. Introduction 

For many reasons, it is natural to regard Jacobi matrices as occupying a certain 
privileged place among all Hermitian matrices. By 'Jacobi matrix' we mean a 
tri-diagonal matrix 



a i 



a i 
b-2 



a n-l 



(1) 



with cij > 0, bj 6 R. The main purpose of this paper is explain why we consider 
a family of unitary matrices introduced recently by Cantero, Moral, and Velazquez 
(see [H]) as playing the corresponding role among unitary matrices. 

Definition 1.1. Given coefficients ao, . . . , a n _2 in D and a„_i £ S 1 , let pk = 

yl — \ak\ 2 , and define 2x2 matrices 



Pk 



Ph 



are lxl matrices. From 



for < k < n — 2, while S_i = [1] and ^ n -i — [a n -i] 
these, form the n x n block-diagonal matrices 

C = diag(So,S2,S4, . . .) and M = diag(S_i, Hi, H3, . . .). 

The CMV matrix associated to the coefficients a , ■ ■ ■ , a„_i is C = CM. 
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Following |22], we will refer to the numbers a,k as Verblunsky coefficients . A 
related system of matrices was discovered independently by Tao and Thiele, [3"T] . 
in connection with the non-linear Fourier transform. Their matrices are bi-infinitc 
and correspond to setting odd-indexed Verblunsky coefficients to zero. (This has 
the effect of doubling the spectrum; cf. |221 P- 84].) 

Expanding out the matrix product CM. is rather labourious. In Figure ^ we 
show the result for n = 8. As can be seen from this example, CMV matrices have 
a rather rigid structure: 

Definition 1.2. We say that an n x n matrix has CMV shape if the entries have 
the following pattern of horizontal 2x4 blocks: 











+ 

* 

* * 



or 



+ 
* 
* 
* 

















* + 



(2) 



where + represents a positive entry and * represents a possibly non-zero entry. The 
top left corner of a CMV matrix always has the 2x3 structure depicted above, 
whereas the bottom right corner will consist of a 2 x 3 block if n is even (left matrix 
above) or a 1 x 2 block (right matrix above) if n is odd. 

Naturally, CMV matrices have CMV shape; a proof of this can be found in the 
original paper of Cantero, Moral, and Velazquez, 6 . Conversely, |Z] shows that 
a unitary matrix with CMV shape must be a CMV matrix. (Alternate proofs of 
these results can be found at the end of Section |3]) This will be very useful for us 
since shape and unitarity properties are easier to check than comparing all matrix 
entries. To facilitate describing the shape of matrices we will use the following 
terms: 

Definition 1.3. The upper staircase of a matrix consists of the first non-zero entry 
in every column (or last in every row). Conversely, the lower staircase consists of 
the first non-zero entries in the rows. 

The entries marked + are particularly important so we give them a name too. 

Definition 1.4. The entries marked + are precisely (2, 1) and those of the form 
(2j — 1, 2j + 1) and (2j + 2, 2j) with j > 1. We will refer to these as the exposed 
entries of the CMV matrix. 

We will now give a brief overview of the contents of the paper. We will structure 
this presentation around the order of the sections. 

In Section 2, we describe the origin of CMV matrices in the theory of orthog- 
onal polynomials on the unit circle. Nothing said there is new; it is provided for 
completeness. 

In Section 3, we show how a general unitary matrix can be reduced to a (direct 
sum) of CMV matrices with a simple efficient algorithm. Our model here is the fa- 
mous Householder implementation of the Lanczos reduction of a general symmetric 
matrix to tri-diagonal shape by orthogonal conjugation [201 §6.4]. This significantly 
reduces the storage requirements during eigenvalue computations. 
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Figure 1 . An 8x8 CMV matrix in terms of the Verblunsky Coefficients 

In Section 4, we describe a symplectic structure on the set of CMV matrices of 
fixed determinant. Note that in the definition of CMV matrices, det(Sfe) = — I for 
< k < n — 2 and hence 

detC = (-I)"- 1 ^.!. 

The specific symplectic structure is defined by algebraic means. This work is in- 
spired by corresponding work on Jacobi matrices stimulated by their appearance 
in the solution of the Toda lattice, |14| . Kostant, |22| . noticed that by altering the 
natural Lie bracket on real n x n matrices, the set of Jacobi matrices with fixed 
trace becomes a co-adjoint orbit. As a consequence, this manifold is a symplectic 
leaf of the Lie-Poisson bracket. Moreover, he uncovered an algebraic interpretation 
of the complete integrability of the Toda lattice, which he extended to other finite 
dimensional Lie algebras. 

As noted in Section 4, the natural algebraic setting for CMV matrices is a Lie 
group rather than a Lie algebra. We show that the set of CMV matrices with fixed 
determinant forms a symplectic leaf for a certain Poisson structure on GL(n; C). We 
will refer to this as the Gelfand-Dikij bracket, though the name Sklyanin bracket 
would be equally appropriate. 

In a concurrent paper, [23| , Luen-Chau Li has independently derived the main 
results of this section. In a sense, his approach is the reverse of ours: he studies the 
action of certain dressing transformations, while we arrive at their existence only 
after studying the problem by other means. 

Section 4 also contains a few simple remarks regarding the Ablowitz-Ladik 
hierarchy. The original (defocusing) Ablowitz Ladik equation ^ |2] is a space- 
discretization of the cubic nonlinear Schrodinger equation: 

-i$ k = pUPk+i + At-i) - 2/3 fe , (3) 

where (3 n is a sequence of numbers in the unit disk indexed over Z and pk = 
(1 — l/Jfel 2 ) 1 / 2 . If we change variables to ctk(t) = e 2lt (3k(t), this system becomes 

-iafc = p\{ak+i + Qffc-i), (4) 

which is a little simpler. If we then choose a_i and o„_i to lie on the unit circle, 
then they do not move and we obtain a finite system of ODEs for a^, < fc < n — 2, 
which is the specific case we treat. 

These equations form a completely integrable Hamiltonian system; indeed they 
were introduced with this very property in mind. By considering all possible func- 
tions of the commuting Hamiltonians, one is immediately lead to a hierarchy of 
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equations containing (@J) as a special case. Similar considerations lead from the 
original Toda equation to the full Toda hierarchy. 

As presented in Section 4, the matters discussed under the rubric 'Ablowitz- 
Ladik hierarchy' are not obviously linked to the equations just discussed. We make 
this connection in Section 5. 

As a Hamiltonian system, the Ablowitz-Ladik equation comes with a symplectic 
form. In Section 5, we prove that this agrees with the bracket we introduced 
from algebraic considerations. Viewed from the opposite perspective, the goal of 
this section is to write the Gelfand-Dikij bracket tensorially using the Verblunsky 
coefficients as a system of coordinates. 

In Sections El and we study further properties of the Ablowitz-Ladik system 
following the work of Moser, [21] , on the Toda lattice. Specifically, we study how the 
spectral measure naturally associated to a CMV matrix evolves under the Hamil- 
tonians of the Ablowitz-Ladik hierarchy. This information is then used to derive 
long-time asymtotics and determine the scattering map. 

The last topic we treat is the construction of action/angle coordinates for the 
finite Ablowitz-Ladik system. This is done in Section 8. 

Acknowledgements: We are delighted to dedicate this paper to Percy Deift on the 
occasion of his sixtieth birthday. Our understanding of the matters discussed here 
owes much to his lectures, We are also grateful to him for encouragement 

along the way. 

2. Orthogonal Polynomials. 

As CMV matrices arose in the study of orthogonal polynomials, it is natural 
that we begin there. We will first describe the relation of orthogonal polynomials 
to Jacobi matrices and then explain the connection to CMV matrices. 

Given a probability measure dv supported on a finite subset of R, say of car- 
dinality n, we can apply the Gram-Schmidt procedure to {1 1 } and 
so obtain an orthonormal basis for L 2 (dv) consisting of polynomials, {pj(x) : j — 
0, . . . , n — I}, with positive leading coefficient. In this basis, the linear transforma- 
tion f(x) <— * xf{x) is represented by a Jacobi matrix. An equivalent statement is 
that the orthonormal polynomials obey a three-term recurrence: 

xpj(x) = a 3 p j+1 (x) + bjPj{x) + dj-iPj-i(x) 

where a_x = and p n = 0. A third equivalent statement is the following: A is an 
eigenvalue of J if and only if A £ supp(cZ^); moreover, the corresponding eigenvector 
is [p (A),pi(A), . . . ,p„_i(A)] T . 

We have just shown how measures on R lead to Jacobi matrices; in fact, there 
is a one-to-one correspondence between them. Given a Jacobi matrix, J, let dv be 
the spectral measure associated to J and the vector e\ = [1, 0, . . . , 0] T . Then J 
represents x \— > xf(x) in the basis of orthonormal polynomials associated to dv. 

Before explaining the origin of CMV matrices, it is necessary to delve a little 
into the theory of orthogonal polynomials on the unit circle. For a more complete 
description of what follows, the reader should turn to [25] . 

Given a finitely-supported probability measure d\i on S , the unit circle in C, we 
can construct an orthonormal system of polynomials, (f>k, by applying the Gram- 
Schmidt procedure to {1,2,...}. These obey a recurrence relation; however, to 



CMV: THE UNITARY ANALOGUE OF JACOBI MATRICES 



5 



simplify the formulae, we will present the relation for the monic orthogonal poly- 
nomials < &fc(z): 

$ fe+ i(z) = z$ fc (z) - a k <P* k (z). (5) 

Here a k are recurrence coefficients, which are called Verblunsky coefficients, and 
denotes the reversed polynomial: 

fe k 
1=0 1=0 

When dfx is supported at exactly n points, cn £D for < k < n — 2 while a n -\ is 
a unimodular complex number. (Incidentally, if dfi has infinite support, then there 
are infinitely many Verblunsky coefficients and all lie inside the unit disk.) 

To recover the relation between the orthonormal polynomials, one need only 
apply the following relation, which can be deduced from : 

k-l 

ii^iL^dM) = n p i > where p i = V 1 - \ a i?- (7) 

(=0 

The Verblunsky coefficients completely describe the measure dfj,: 

Theorem 2.1 (Verblunsky). There is a 1-to-l correspondence between probabil- 
ity measures on the unit circle supported at n points and Verblunsky coefficients 
(ao, . . • , &n-i) with a k £ D for < k < n — 2 and a n _i G S . 

From the discussion of Jacobi matrices, it would be natural to consider the matrix 
representation of f(z) i— > zf(z) in L 2 (dfi) with respect to the basis of orthonormal 
polynomials. This is not a CMV matrix; rather it is what Simon, |29) . has dubbed 
a GGT matrix, from the initials of Geronimus, Gragg, and Teplyaev. Perhaps the 
most striking difference from a CMV (or Jacobi) matrix is that a GGT matrix is 
very far from sparse — generically, all entries above and including the sub-diagonal 
are non-zero. (In a sense, CMV matrices are optimally sparse; see Theorem 13.61 ) 

Cantero, Moral, and Velazquez had the simple and ingenious idea of applying 
the Gram-Schmidt procedure to {1, z, z , z 2 , z~ 2 , . . .} rather than {1, z, . . .}. The 
resulting functions, Xk{z) (0 < k < n — 1), are easily expressed in terms of the 
orthonormal polynomials: 

\ z' k / 2 4>* k (z) : k even 

Xk{z) - \*-<*-WV*(*) :fcodd. (8) 

In this basis, the map f(z) <— > zf(z) is represented in an especially simple form: 

Theorem 2.2. In the orthonormal basis {xfe( z )} °f L 2 (d[i), the operator f{z) i— > 
zf(z) is represented by the CMV matrix associated to the Verblunsky coefficients of 
the measure dfi. 

The C = LM. factorization presented in the introduction originates as follows: 
Let us write Xk , < k < n — 1, for the orthonormal basis constructed by applying 
the Gram-Schmidt procedure to {1, z , z, z~ 2 , z 2 , . . .}. Then the matrix elements 
of £ and Ai are given by 

£j+i,fc+i = (Xj{z)\zx k (z)), M j+ i,k+i = (xj(z)\xk(z))- 
See for further discussion. 
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The measure dfj, can be reconstructed from C in a manner analogous to the Jacobi 
case: 

Theorem 2.3. Let dfi be the spectral measure associated to a CMV matrix, C, and 
the vector e\. Then C is the CMV matrix associated to the measure dfi. 

Proofs of these two Theorems can be found in [H] or ■ The second result also 
follows from Corollary 13 .41 below. 

3. Reduction to CMV shape. 

The purpose of this section is to explain a simple analytic (and numeric) al- 
gorithm for the reduction of a unitary matrix to CMV form. At the core of the 
algorithm is the Householder method for reducing the number of nonzero entries of 
a matrix by conjugating it with reflections. These are chosen in such a way as to 
allow for successive applications of the reduction. 

The Householder algorithm is the method of choice to reduce any real symmetric 
matrix to tridiagonal form or any non-symmetric matrix to Hessenberg shape. Here 
we implement the successive reductions differently, by alternating columns and rows 
(see Figure|5J) to reduce any unitary matrix for which e\ = [1,0,..., 0} T is cyclic to 
CMV shape. With the obvious modification, it reduces other unitary matrices to 
a direct sum of CMV matrices. 
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Figure 2. The CMV-ification algorithm in action. 



Let us briefly describe the construction of Householder reflections; what we 
present is a very slight modification of what one will find in most textbooks. Given 
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m£C" and < m < n, let 



Um+1 



( t Kf) 



1/2 



V = 



[0, . . . ,0, a,u m+2 , . ■ ■ ,u n ] with a = u m+1 



The reflection through the plane perpendicular to v is given by 



vv 



,t 



R = I - 2 




Naturally, RJ = R 1 = R; moreover, 



Ru = u — v = kti 



, . . . , M m , u m +i - a, . . . , 0] 



and since the first m entries of v are identically zero, 7? has the block structure 



where I m is the mxm identity matrix, and * denotes an unspecified (n—m) x (n—m) 
matrix. 

When working over K, one typically chooses a slightly differently to guarantee 
that u m +i — a is positive (or zero). This is not possible in our case. To remedy the 
situation, we multiply by a suitably chosen rotation D — diag(f , . . . , f , e l6 , 1, . . . , 1) 
where e 10 sits in the (m + l)th position. 

Replacing R by DR gives us the desired modification of the standard Householder 
reflection. We summarize its properties as follows: 

Lemma 3.1. Given u 6 C™ and < m < n there is a Householder reflection R 
that has the block structure © and 



We will refer to R as the reflection at level m for u. 

Theorem 3.2 (A Householder algorithm for unitary matrices). Any n x n unitary 
matrix, U , for which e\ is cyclic can be reduced to CMV shape by successive conju- 
gations with Householder reflections. Moreover, this can be done without altering 
the spectral measure associated to e\. 

Proof. During the proof, we will refer the reader to Figure |21 The small numbers 
above each arrow enumerate the steps in the algorithm. 

As we will see, at every step U is conjugated by a matrix that leaves e\ invariant. 
Thus the spectral measure associated to e\ will not change. In particular, this vector 
will remain cyclic throughout. 

For the first step, we choose R to be the reflector at level two for the first 
column of U and then form RUR^ . Left multiplication by R creates zeros in the 
places indicated and these are not destroyed by right multiplication by R) because 
of its block structure. One detail remains: why the entry marked + is not zero. The 
answer is simple. This entry would only vanish if the first column of the original 
matrix where a multiple of e\. This would make e± an eigenvector for U and so not 
cyclic. 

Next we describe Step 3 as an archetype for all subsequent odd-numbered steps. 
Even-numbered steps differ from odd-numbered steps only by interchanging the 
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roles of rows and columns. We will write U for the matrix output by step two (not 
the original one). 

Let R be the reflector at level 3 associated to the second column of U, then the 
effect of Step 3 is U i-> RUR) '. For the reasons explained in Step 1, this will produce 
zeros in the second column as shown in Figures [5] and [3] This state of knowledge is 
represented by the centre matrix in Figure Two questions remain however: why 
is the entry newly marked + not zero, and why do additional zeros appear in the 
fourth column. We answer them in reverse order. 
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Figure 3. Matrices in the discussion of Step 3. 



The rows of a unitary matrix must be orthogonal; in particular, all rows must 
be perpendicular to the first. As the third entry in the top row is non-zero (indeed 
positive), it follows that the third column must have zeros in the places indicated 
(cf. the third matrix in Figure [3J. 

If the entry marked + in the second column were actually zero, then by the 
reasoning of the previous paragraph, its right-hand neighbour would also vanish. 
This contradicts the cyclicity of e%; specifically, the linear span of {ei, e%, e%} would 
be an invariant subspace. 

With obvious modifications, these arguments apply to any odd step (> 3). As 
noted even steps are analogous but with rows and columns reversed; indeed one 
can just apply the discussion above to U*. (As U is unitary, e\ is also cyclic for 
C/t.) 

Note that in the last step, no additional zeros are produced. This does not result 
in any modifications to the argument, we merely wish to point out that in this case, 
E is a diagonal matrix. □ 

Theorem l3.2l tells us how to reduce any unitary matrix having e\ as a cyclic vector 
to CMV shape. Now we wish to show that any unitary matrix U in CMV shape is 
actually a CMV matrix, i.e., there exists a set of coefficients ao, . . . , a n -2 £ © and 
a n -i S S 1 so that U is the matrix given by Q). This result is not new, and can 
be found in [7]. We prove this result differently, and obtain it as a corollary of the 
following 

Proposition 3.3. Let B and C be two unitary matrices in CMV shape that have 
the same spectral measure with respect to the vector e\. Then B = C . 

Proof. Since B and C have the same spectral measure with respect to e%, there 
exists a unitary matrix V so that Ve\ = e\ and 

C = V^BV. (10) 

We inductively prove that Vet = e^ for all 1 < k < n, which shows that V is the 
identity and so B = C. 
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Assume that Vek = &k holds for all k < m. (From the definition of V, it is true 
when m — 1.) This implies that the top left m x m blocks of B and C coincide: 

(e j \Ce k ) = {Ve j \B{Ve h )) = {e j \Be k ) 

for all 1 < j, k < m. We now proceed differently depending on m. 

If m > 3 is odd, consider the (m — l) th columns of C and B. As both matrices 
are of CMV shape and agree on the principle m x m minor, the only place that 
these columns may differ is in the (m + l) th position. In fact, they must agree here 
too because CMV shape requires that this entry be positive and then it is uniquely 
determined by the fact that the columns of a unitary matrix have norm one. 

From VC = BV, Ve m -x — e m -i, and Ce m -i = Be m -i, which we just proved, 
we have 

(V - l)Ce m _! = (BV - C)e m _ a = 0. (11) 
But now by the shape of the (m — l) th column of C described above, we know that 

m 

CCm— 1 — Cm+l,m— lEm+1 ~\~ ^ Cfc,m— l^fc 

k— m — 2 

with C rn +x,m-x > 0. Substituting this into I|11J) gives (1 — V)e m +x = 0, which 
concludes the inductive step in this case. 

If m = 1, one should repeat the above analysis using the first column. 

If m is even, then one must repeat the above argument using rows instead of 
columns. Alternatively, one may proceed as above after first taking the (conjugate) 
transpose in 110(1 ■ □ 

Corollary 3.4. Let U be an n x n unitary matrix in CMV shape. Then there is a 
unique (n — 1) -tuple of Verblunsky coefficients ao, . . . , ot n -2 £ © and a„_i £ S 1 so 
that U is the CMV matrix associated to these coefficients. 

Proof. For a unitary matrix in CMV shape, let \x be the spectral measure associated 
to e\, and ao, . . . , a n -i the corresponding Verblunsky coefficients. The CMV ma- 
trix C given by Definition ll.ll has p, as its spectral measure. So, by Proposition ^. 31 
the original matrix must equal C, which is indeed a CMV matrix. Uniqueness of 
the coefficients is a direct consequence of Proposition 13 . 31 and Verblunsky 's Theo- 
rem O □ 

Remark 3.5. There is another way of proving the corollary, which involves recur- 
sively identifying the a's in a unitary matrix of CMV shape. This is essentially the 
proof of Theorem 3.8 from [J]- 

Further note that one can read the Verblunsky coefficients from a CMV matrix. 
In other words, for any unitary matrix C in CMV shape there exists a unique set 
of Verblunsky coefficients so that C is the CMV matrix with these coefficients. 

We close this section by presenting a result of Cantero, Moral, and Velazquez 
(see Theorem 3.9 of [7]) which implies that, in terms of sparseness, CMV matrices 
are minimal among unitary matrices having cyclic vectors. 

We call a unitary matrix, U, (p, q)-diagonal if it has only p nonzero subdiagonals 
and q nonzero superdiagonals. In other words, we require that Uj ik = for j — k > 
p + 1 and k — j > q + 1. Note that CMV matrices are (2, 2)-diagonal. With this 
definition, the statement is the following: 
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Theorem 3.6. A unitary (p, l)-diagonal matrix is a sum of diagonal blocks of order 
no greater than p + 1. The same is true of any unitary diagonal matrix. 

In particular, note that any 4-diagonal unitary matrix is a direct sum of 3 x 3 
blocks. As breaking a matrix into a direct sum is tantamount to factoring the 
characteristic polynomial, no exact algorithm can perform this task for generic 
input. Similarly, it is impossible to reduce a unitary matrix to four diagonals 
whilst preserving the spectral measure associated to e±. 

4. Lie structure. 

The manifold of Jacobi matrices with fixed trace forms a co-adjoint orbit associ- 
ated to a particular Lie algebra structure on the n x n matrices (although not the 
one induced by GL(n;M.)). This gives the manifold a symplectic structure. These 
matters are described in detail in ^1[57||2H], for example. 

In this section, we will show that the manifold of CMV matrices with fixed 
determinant is a symplectic leaf for a natural Poisson structure on GL{n, C) and 
(what is basically equivalent) it is exactly the orbit of a natural group action on this 
space, and so also have a natural symplectic structure. For pedagogical reasons, 
we will present the symplectic structure first and then describe how it relates to a 
group action. (As described in the Introduction, |23| . takes the opposite approach 
to arrive at the same conclusions.) 

In deference to those new to the subject, we will endeavour to use the notation 
and terminology of [27] • We will also mostly refer to this book for proofs that 
we omit; we hope that the authors of the original articles will forgive this indirect 
reference to their work. 

Self-adjoint matrices form a vector space, hence it is natural that the orbit and 
symplectic structures on Jacobi matrices arises from the study of a Lie algebra 
(actually, its vector-space dual). Unitary matrices form a group and hence the 
natural backdrop for CMV is a Lie group. Specifically, the construction we give 
below is that of a Sklyanin bracket on Gl(n,C); see |57| §2.12]. However, we 
choose to give a presentation in which the algebra of matrices takes centre stage; an 
analogous construction for KdV using the algebra of pseudo-differential operators 
was given by Gelfand and Dickij ^7J. This approach is described in Section 2.12.6 of 
[77] . (Note that here we are referring to the second symplectic structure associated 
with KdV, which was originally proposed by Adler §4].) 

Let q denote the (associative) algebra of n x n complex matrices. The algebra 
structure gives rise to a natural Lie algebra structure: 

[B, C]=BC- CB. 

Of course, this also results from viewing g as the Lie algebra of GL(n; C). 
As a vector space, g = I © o, where 

a = {A: A = -A% 

which is the Lie algebra of the group U(n) of n x n unitary matrices, and 

[={ieg: Lij = for i > j and L iti G M} 

which is the Lie algebra of the group L(n) of n x n lower triangular matrices with 
positive diagonal entries. We will write ir a and ~k\ for the projections into these 
summands. 
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This vector-space splitting of g permits us to give it a second Lie algebra struc- 
ture. First we define R : g — > g by either 

R(X) = m(X) - n a (X), forallXe , or 

R(L + A) = L - A for all A G a and L G I ' ' " ' 

The second Lie bracket can then be written as either 

[X,Y] R = \[R(X),Y] + \[X,R(Y)] VX,Yeg,or 

[L + A,L' + A'] R = [L, L'] — [A, A'] V L, L' G I, and A, A' G o. '"" 

The second definition also makes it transparent that the i?-bracket obeys the Jacobi 
identity. 

Whether one treats Jacobi or CMV matrices, the (vector space) dual g* of g 
plays an important role. We can identify it with g using the pairing 

(X,Y) =Imtr(Xy). (14) 

This is not an inner product; however, it is non-degenerate — just choose Y = iX'. 
It is also symmetric and ad-invariant: 

{X, [Z, Y]) = lmti(XZY - XYZ) = ([X, Z],Y). (15) 

This is equivalent to Ad-invariance: 

{BXB^.BYB- 1 ) = (X, Y), for any B G GL(n, C). (16) 

Concomitant with the pairing is an identification of 1-forms and vector fields 
on g: 

Definition 4.1. Given (j> : g — > R and B G g, define V</> : g — > g by 

d_ 

It 

Equivalently, if we write bk,i — Uk,i + iVk,i for the matrix entries of B, then 

m k , = ^+ip- as) 

ov Uk du Uk 
(notice the reversal of the order of the indices) . 

We can now define the desired Poisson bracket on g. In fact, by first checking 
some elementary properties of the objects set out above, one can show that it obeys 
the Jacobi identity. 

Proposition 4.2. Let R : g — > g and {■ , •) be as above. 

(i) o ±= o and I 1 - = I. 

(ii) R is antisymmetric: (X,R(Y)) = -(R(X),Y). 

(iii) R obeys the modified classical Yang-Baxter equation: 

[R(X),R(Y)} - R([R(X),Y] + [X,R(Y)}) - -[X,Y] 

(iv) Given <j), ip : g -> R, let X = and Y = Vip. Then 

{^}\ B = \{R{XB),YB) \{R{BX),BY) (19) 
= \{R(XB),YB) + \{BX,R(BY)). (20) 
defines a Poisson structure on g. 



<P(B + tC) = (V<f>\ B ,C). (17) 

t=o 
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Proof, (i) For A,B e a, 

Imtr(AB) = ± tr(AB - B^A r ) = ± tr(AB - BA) = 0. 

For L, V <E I, the diagonal entries of LL 1 are products of the corresponding entries 
in L and V . In particular, they are real; therefore, Imtr(LL') = 0. 

We have just shown that a" 1 C a and [ C [. As the pairing is non-degenerate, 
equality follows by dimension counting: dim(o) = dim([) = n 2 — |dim(g). 

(ii) Antisymmetry follows from part (i) by simple computation: 

(R{L + A),L' + A') = (L, L') - (A, A') = (L + A, R{L' + A')). 

(iii) This can be checked by direct computation. However, this belies the impor- 
tant role of the mCYB equation in the theory of double Lie algebras and thence in 
the theory of integrable systems. For more information, see |27l §2.2] or |12l I28j . 

(iv) This is an example of a much more general statement, namely, these formulae 
define a Poisson bracket on g whenever (• , •} is symmetric and ad- invariant and R 
has properties (ii) and (iii). The only axiom of a Poisson bracket that is not 
immediate from the definition is the Jacobi identity. Unfortunately, checking this 
is consumes more space than we can justify here. Details can be found in |17l §4] , 
but readers should not be discouraged from simply doing it themselves. We hope 
that the following two hints make this more palatable: 

V{</>, ?P}\ B = \R{XB)Y - \R{YB)X + \XR{BY) - \YR{BX) (21) 

and given any trio of matrices A, B, and C, 

J2(R(B)A-R(A)B, R(C)) 

= J2H R ( R ( B ) A - R ( A ) B )> C) + \{R{B)R(A)-R(A)R(B), C) 
= i£<L4,5], C) 

where all sums are over cyclic permutations of (A, B,C). □ 

The Poisson bracket given in the proposition is often referred to as the 'quadratic 
bracket' because the point B G g where it is evaluated appears quadratically. The 
Lie- Poisson (or Kirillov) bracket associated to the Lie algebra (g, [,] R ) is linear 
in B: 

{>>Lp| b = 5<[^L,5}. 

This bracket is known to be compatible (in the sense of Magri-Lenard) with the 
bracket above; however, it is not pertinent to the study of CMV matrices. It is 
relevant to Jacobi matrices: under the embedding J t— > i J, the manifolds of Jacobi 
matrices with fixed trace are symplectic leaves. This is just the usual construction 
in G/(n,K) in disguise. 

Lemma 4.3. The Hamiltonian vector field on g associated to <f> : g — > K is 

B = | [BR(XB) - R{BX)B) (22) 
where X — \J<j). Equivalently, 

B = Bnt(XB) - ^{BX)B = n a (BX)B - Bn a {XB). (23) 
In particular, if B is unitary, then 

B = —Bir a (B~ 1 LB) (24) 

where L = n\{BX). 
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Proof. In terms of the Poisson bracket, the defining property of the Hamiltonian 
vector field is {4>,ip} = tp = (B,Vip). Equation (|22[l now follows by cycling the 
trace: if Y = Vtp, then 

{</), ^} - ±(R(XB),YB) - \(R(BX) 1 BY) (25) 

= \{BR{XB) - R(BX)B, Y). (26) 

From the definition, R = Id— 2n a = 2%i — Id. Equation (|23[l follows by substi- 
tuting these relations into l|22l) . 

To obtain the special case, let us write BX = L + A. As A and B~ x AB are 
anti-Hermitian, ir a (A)B = Bir a (B^ 1 AB). This implies 

B = n a (L + A)B - Bn a (B- 1 (L + A)B) = -BiT a (B- 1 LB) , (27) 

which is exactly (|24|l . □ 

Proposition 4.4. The symplectic leaf passing through a particular CMV matrix 
contains only CMV matrices and all have the same determinant. 

Proof. Proving that the determinant is a Casimir is easy, so let us start there. 
Given 9, let 4>e(B) = lm[e l ° logdet(S)] on an open neighbourhood of C. Then 
Vcf> e = e l6 B- x and so R(BX7(f> e ) = i?(V^ S) = er id Id. Therefore, 

= |(e- ie Id,W-B) - i(e- ie Id,£VV>) = f Imtr([W,£]) - (28) 

for any function ip and any angle 9. 

Let C be a unitary matrix and 4> '■ — * K. From (|24(l in Lemma f4. 31 we see that 
there is an anti-hermitian matrix A so that under the Hamiltonian flow generated 
by 0, C = —CA. Thus C remains unitary. 

Now let us restrict our attention to the case of C a CMV matrix. We will show 
that under the </>-flow, C remains in CMV shape, which implies that C remains a 
CMV matrix by Corollary E3 

By (|23|) from Lemma f4. 31 there are L,L' 6 [ so that 

C = L'C- CL. (29) 

Elementary calculations show that (left or right) multiplication by a lower trian- 
gular matrix does not change entries above the upper staircase; they remain zero. 
Moreover, exposed entries are simply multiplied by the corresponding diagonal en- 
try in the lower triangular matrix. The first fact shows that C vanishes above the 
upper staircase of C. The second shows that the exposed entries in the upper stair- 
case obey an equation of the form Cij — j(t)Cij for some real- valued function 7; 
therefore, they remain positive. 

This reasoning can be transferred to the lower staircase by noting that 

fit C f = -C+CC* = C ] L' - LCl (30) 

This completes the proof that C remains in CMV shape and hence, of the propo- 
sition. □ 

It remains for us to show that the symplectic leaf actually fills out the set of 
CMV matrices with fixed determinant. As this manifold has dimension 2(n — 1), 
one solution to this problem would be to find n — 1 functions that Poisson commute 
and have linearly independent Hamiltonian vector fields. While appearing round- 
about, this approach is actually rather efficient for us because there is just such a 
family of Hamiltonians that we wish to study anyway, namely, those of the form 
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B i— > Imtr{/(_B)}, for some polynomial /. Our interest stems from their relevance 
to the Ablowitz-Ladik hierarchy. We begin with the simplest abstract properties of 
these Hamiltonians. 

Proposition 4.5. Given a polynomial f, let (f> : g — > K by 

<f>(B) = Imtr{/(B)}. (31) 

(i) V<t>\ B = f{B). 

(ii) Functions of the type <|31|) Poisson commute. 
(hi) Under the flow generated by <f>, 

B=[B, \R(Bf'(B))] = [B, n(Bf'(B))) = -[B, n a (Bf'(B))]. (32) 

(iv) There is a unique factorization 

exp{tBf'(B)} =L{t)Q-\t) (33) 

with L(t) E h(n) and Q(t) e U(n). 

(v) The integral curve B{t) with B(0) = B is 

B{t) = L-\t)BL(t) = Q- l {t)BQ{t). (34) 

Proof. Part (i) follows for monomials by cycling the trace. This extends to poly- 
nomials by linearity. 

Part (ii) is readily deduced from part (i): if ip(B) = lmtr{g(B)}, then [B, V<j)] = 
[B, V-0] = and so {<fi, ip} = follows immediately from the definition, (|19|) . 

The first inequality in (|32|l follows immediately from (|22|l and the fact that 
[B, V0] = 0. The second and third equalities follow from (|23|l . 

After taking the adjoint of both sides, H33JI becomes the QR factorization. 

We will prove part (v) by direct computation. As 134(1 is certainly true when 
t = 0, it suffices to check that all three formulae obey the same differential equation. 
We will just prove this for Bi(t) = L- 1 {t)BL(t) as Q- 1 (t)BQ(t) can be treated 
identically. From the definition, 

Bx(t) — L~ 1 (t)BL(t) - L~ 1 (t)L(t)L~ 1 (t)BL(t) = [Bi(t), L^ 1 (t)L(tj\ 

so by (|32|l . it suffices to show that for every t, tti (J5i/' '(-Bi)) = L^ 1 L. This can be 
demonstrated by differentiating H33fl : 

Bf'{B) aip{tBf'(B)} = d t vip{tBf'(B)} = LQ' 1 - LQ^QQ- 1 , (35) 

which implies Bf'(B)LQ- 1 = LQ- 1 - LQ^QQ" 1 and so 

Bxf(B x ) = L~ 1 Bf'(B)L = L- x L{t) - Q^Q. (36) 

This shows not only that n(Bif'(Bi)) = L- 1 L, but also w a (Bif (Bi)) = -Q^Q. 

□ 

Part (iii) of this proposition gives Lax pair representations for the flows generated 
by Hamiltonians of this form. From the abstract theory, this is to be expected: B ^> 
Imtr{/(i?)} are central functions on g — that is, they are constant on conjugacy 
classes — cf. [57| §2.12.4]. It is not difficult to see that these Lax pairs are precisely 
those discovered in 

Of course, the key to reconciling the concrete work of Nenciu with the abstract 
approach described here is precisely the discovery that CMV matrices have a Lie 
theoretic interpretation. 
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Similarly, (|34(l can be deduced from the abstract theory. Because of the sparsity 
of the CMV matrix and the ready availability of good implementations of the QR 
algorithm, this provides an easy method for computational studies of the Ablowitz- 
Ladik hierarchy. Those familiar with the work of Deift, Li, Nanda, and Tomei 
[S| 111 j might immediately ask if we can say anything new about diagonalization 
algorithms for unitary matrices; this will be discussed at the end of Section 

We will continue our investigation of these Hamiltonians in Sectional By bor- 
rowing a result from there, we can prove the main result of this section: 

Theorem 4.6. The manifold of CMV matrices with fixed determinant form a sym- 
plectic leaf in the Poisson manifold Q. 

Proof. Let us write z\,...,z n for the eigenvalues of the CMV matrix C. By La- 
grange interpolation, we can find polynomials Fk so that Fk(0) — and Fk(zj) — 
5jk- It is then elementary to construct polynomials fk with F k {z) — 2zf' k {z). 
Now consider the following functions on q: 

c(> k (B)=Imtv(f k (B)). (37) 

By Corollary 16.41 <f>i, . . . , <j) n -i give rise to linearly independent Hamiltonian vec- 
tor fields. Moreover, by Proposition 14.51 these Hamiltonians Poisson commute. 
Therefore, the symplectic leaf passing through C must be of dimension no less than 
2n - 2. 

From Proposition ^. 41 we can deduce that the dimension must be exactly 2n — 2. 
As the space of CMV matrices with fixed determinant is path connected (it is home- 
omorphic to ID" -1 ), a simple chain-of-balls argument shows that the symplectic leaf 
must exhaust this manifold. □ 

In the remainder of this section, we will describe dressing transformations and 
determine how they act on CMV matrices. 

We need to consider three groups G = GL(n,C), D = G x G, and Gr = 
L(n) x U r (n). By U r (n) we mean the group ofnx« unitary matrices with the 
order of multiplication reversed: lo\ * UJ2 — u)2<jO\. (We will use lower-case Greek 
letters for group elements.) 

The significance of the group Gr is that the corresponding Lie algebra is g with 
the bracket given by l|18[l. It is also worth noting that the map (A,w) i— > Xuj^ 1 
defines a diffeomorphism from Gr to G. The inverse mapping is given by (3 <— > 
(/?+,/?_) where [3 + £ h(n) and (3- S U(n) are the solution of the factorization 
problem (3 = (3+(3Z ■ This problem is always uniquely soluble; indeed, it is essen- 
tially the QR factorization of /r . 

Both G and Gr can be regarded as subgroups (and submanifolds) of D via the 
following embeddings: 

i:G^D by >-> (J3,j3), and 
i':Gr^D by (A^)^(A,^- 1 ). 

Moreover, the product map 



*•*' :GxG R ^D by (P,X,cj) ^ {PX^UJ- 1 ) 
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defines a diffeomorphism (as does i' ■ i). The inverse mapping is easily seen to be 
7r © 7r' where 

tt'-.D^Gr by K,»y)->([(r 1 '7)+]- 1 .(r 1 '7)-), and 

n-.D^G by ~£(rS)+ = ?y(CS)- 

These projections also permit us to identify the coset spaces D/Gr and G\-D with 
G and G_r respectively. 

The group D acts by right multiplication on D/Gr and hence, by the identifi- 
cation just noted, on G. As the product map i ■ i' is onto, we can get a complete 
understanding of this group action by studying its restriction to the subgroups i(G) 
and i'{Gr). 

The first subgroup, i(G), leads to nothing new, just the action of G on itself by 
left multiplication; however i' (Gr) is different, it leads to dressing transformations: 
(A, ui) £ Gr maps G to itself via 

[3 h+ 7t(A/3,lj _1 /3) = A/3(/3- 1 A- 1 w- 1 /3)+ = oj- 1 ^ 1 ^ 1 ^ 1 ^)-. (38) 

Theorem 4.7. The orbit of a CMV matrix under the dressing transformations 
(|38[1 is precisely the set of CMV matrices with the same determinant. 

Proof. As we have already proved Theorem l4.6l this follows from the general theory 
of Poisson Lie groups (cf. [23 §2.12]). However, as it is possible to give a quick 
and concrete proof that the orbit coincides with the symplectic leaf, we do so. 
When is unitary, the formula for the action can be simplified considerably: 

(/ru- 1 ^- 1 /?)- = /r^/r^- 1 )- and so 

(A,w) :/3h+ (/r^ -1 )-. (39) 

In particular, u> plays no role. This formula also shows immediately that the orbit 
of a unitary matrix contains only unitary matrices. 

As the group Gr is connected, we can prove the theorem by showing that the 
tangent vectors to the orbit coincide with those of the symplectic leaf. 

Writing (|39|) with A = e~ tL for some L £ I and using the fact that (3 is unitary 
shows 

i ► (/3" 1 e ti )_ = /3(/?- 1 e ti /3)_ = j3 - t[3-K a {[3- l Lj3) + 0(t 2 ). (40) 

Thus, the tangent space to the orbit through (3 is given by —f3n a ([3~ 1 L[3) as L 
varies over [. This is exactly the tangent space to the symplectic leaf as given in 
igUl of Lemma PI □ 

5. The Ablowitz-Ladik Bracket 

As discussed earlier, the n-tuple (ao, . . . , a„_2, a n -i) G D n_1 x S 1 of Verblunsky 
coefficients gives a system of coordinates for the manifold of CMV matrices. 

As shown in Section the determinant, det(C) = (— l) n_1 a„_i is a Casimir 
and so (ao, ■ ■ ■ , ctn-2) can used as coordinates for the symplectic manifold of CMV 
matrices with fixed determinant. It is therefore natural to write the Gelfand-Dikij 
bracket in these coordinates. By doing so, we will recover a bracket introduced 
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earlier for the study of the Ablowitz-Lakik system: 

n-2 

{/,s} 2 = 5>? 

n-2 
3=0 

where ay = Uj + i«j for all < j < n — 2, and, as usual, 

— --(— -i— \ and —-If— l —\ 

da 2 \du dv J da 2 \du dv J 

For clarity, we will call this second bracket the Ablowitz-Ladik bracket. Note that 
this differs by a factor of two from that used in we will adjust results quoted 
from this paper accordingly. 

On the set of Verblunsky coefficients (ao, ■ ■ ■ , a n ~2) £ B" _1 (with fixed a n -i £ 
S 1 ) we consider the Hamiltonians He(K m ) and lm(K m ), where 

K m = M,{C m ) 

for m > 1. The evolutions under the flows generated by the real and imaginary 
parts of K m in the Ablowitz-Ladik bracket were found by Nenciu |26|: 

C = {Re(K m ),C} 2 = -[C, ir*(iC m )}, (43) 

C = {lm(K m ),C} 2 = -[C, 7T a (C m )}- (44) 

To obtain the formulae given above from those of one should account for the 
difference in notation. In |26| . a subscript + is used to indicate 

{Bij :i<3 
\B« :i = j 
:i>j 

from which it follows that B + — {B + y = Tr a (B); indeed both sides are antisymmetric 
and agree on the upper triangle. 

These formulae also hold true if we replace the Ablowitz-Ladik bracket by the 
Gelfand-Dikij bracket (|19|l , as one can immediately see from the second identity in 
(I23|l . This suggests that the Ablowitz-Ladik bracket might be the one we seek. 

We will indeed prove that the Gelfand-Dikij bracket defined in Proposition ^. 21 
agrees, in the a coordinates, with that given above. 

Proposition 5.1. Consider the functions 4>j(C) = ^[i a Cjj] where 1 < j < n and 
a G {0, 1}. The Gelfand-Dikij brackets among these functions are the following: 

= -{#-1,$} = Im^C^C^} (45) 
= -^_ 2 Im[i 6+0 a J _ 3 a,-i] ) (46) 

for all choices of a, b and j, 

= - |C,+i,,f - fe-il 2 = PU -P?_ 2) (47) 

and all other pairs Poisson commute. Following the standard conventions, a_i = 
— 1, jO_i = 0, and matrix entries with indices laying outside the bounds of the matrix 
are zero. 



df_dg_ _ df dg 

duj dvj dvj duj 



(41) 



df dg df dg 



daj daj daj da.j 



(42) 
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Proof. By definition, = i a Ejj where Ejj is the elementary matrix with a 1 in 



the position. Thus, 

{<p],4} = h{ R ^ aE 3 3 C),i b E kk C) - ±{R(i a CE 00 ),i b CE kk ) (48) 

= lmi b (e k \Cir { (i a E 3J C)e k ) - lmi b (e k \iri{i a CE J3 )Ce k ) (49) 

= -lmt b (e k \CTT a (t a E JJ C)e k ) + Imt b (e k \ir a (i a CE 33 )Ce k ) (50) 



where the last two lines follow from R = 2iri — Id = Id — 2ir a and by observing that 
the identity terms cancel. The proof proceeds differently depending on the parity 
of j. 

Fix j, even. Then both A = TT a (i a EjjC) and B = TT a (i a CEjj) have only three 
non-zero entries: 

A j,j -hmi-'C,.,!. A jj+1 = -Aj+ij = re,., . i 

Bj tj i lnii /"'(',.,;.. i j = I'. j : = i a Cj-i,j 

Thus by (|50|l . the bracket can only be non-zero for k £ {j — l,j,j + 1}. Moreover, 
if k = j — 1 then only the first summand in (|50|l contributes, while for k = j + 1, 
only the second summand contributes. Multiplying things out gives and 
Equation l|4()|) and the last formula in l|47|) follow from the expressions for the matrix 
entries. 

We now treat j odd using l(4*9"|) . Both L = -Ki(i a EjjC) and M = ir\(i a CEjj) have 
only four non-zero entries: 

Ljj = '"(', ,;. £jj-i = : • L j+i,3 = * aC jJ+i> ^j+2,j = « a Cj, J+ 2, 

M W = i:c;/"C,. ; ;. M i+ ij = i a C j+hj , Mjj-i = Mj,j-2 = ¥C~^. 

Thus the first summand in l|49|l is only non-zero for fc e {j — 1, j'}, while the second 
summand contributes only for S + If k — j + 1 or j — 1 this leads quickly 
to the formulae given in the proposition. When k = j it is a little more involved: 

= Im(^[|C JJ+1 | 2 + |C„- +2 | 2 - IC^f - |C,-_ 2J | 2 ]) 

= Im(i b -»[-|C iJ _ 1 | 2 + |C,- +1J | 2 ]) 

where we used the fact that the j row and column of C are unit vectors. Taking 
a = and 6=1 gives (gZJ. □ 

To simplify our calculations, we extend the Poisson brackets defined above to be 
bilinear over C. Note that {<fi,ip} is then the complex conjugate of {(f), ip} and so 
the values of {4>,ip} and {(f), tp} suffice to determine all Poisson brackets between 
the real and imaginary parts of 4> and if). 

In the previous proposition, we calculated the Gelfand-Dikij brackets of the real 
and imaginary parts of the diagonal entries. After a little computation, we obtain 
the following equivalent information: 

{C ji ,C ij } = 2i(p|_ 1 -p|_ 2 ) ) (51) 

I I = -2iPj-2®j-30ij-i, (52) 

I,; \ = 0, (53) 

and Cjj commutes with all other C kk and C kk . These equations are the key to 
showing that the two brackets are the same: 



CMV: THE UNITARY ANALOGUE OF JACOBI MATRICES 



19 



Theorem 5.2. For any < k, I < n—2 the Gelfand-Dikij brackets of the Verblunsky 
coefficients are given by 

{a fc ,a/} = and {a k ,ai} = -2iS k ip 2 k . (54) 

That is, the Gelfand-Dikij and Ablowitz-Ladik brackets agree. 

Proof. We will prove fK4^l by induction on k + l. As di = ao, equation (fBTJl settles 
the case k = I = 0. Using the k <-> I antisymmetry, we divide the inductive step 
into three cases: 

The case I = k: As Cfc+i.fc+i = — ct k -ia k , equation (|51() implies 

-2i\a k fp 2 k _ x + |a fc _i| 2 {afc,a fc } = 2i(p 2 k - p\_ x ), 

which simplifies to {a k ,a k } — ~2ip k , because a k -\ is non-zero on a dense set. 

The case I > k + 1: As Cfc+i.fc+i = — a k -ia k , with the usual convention a_i = 
-1, 

= {Cfc+i.fc+i7<w+i,2+i} = cik-ioti-i{a k , ai}. 
Similarly {C fc +i ifc +i,C ; +ij+i} = implies {a fe ,a/} = 0. 

The case I = k+l: By l|53() . we have {C/c+i.t+i, C/c+2,fc+2} = from which we 
deduce {a k ,a k+ \} = as above. Finally, by lt5^|) with j = k + 2, 

2ip 2 k a k _ 1 a k+1 = {a k -ia k , a k a k+1 } = 2ia k - 1 a k+1 p 2 k + a k _ia k {a k , a k+1 }. 

Cancelling gives {a k , a k +i} = 0. □ 

Remark 5.3. The evolution of the Verblunsky coefficients in the Ablowitz-Ladik 
bracket under the flow generated by 2Re(ii'i) is the Ablowitz-Ladik evolution (see 
Q] and 0): 

{ctj, 2Re(Ki)} 2 = ip 2 (a.j-i + ay+i). 

The proof of Theorem 15.21 was based around the computation of the Gelfand- 
Dikij brackets of diagonal entries. While it is possible to directly compute the 
Gelfand-Dikij bracket of any pair of entries of a CMV matrix, even the formulae 
for the Hamiltonian vector fields are rather unpleasant. However, in the particular 
case of exposed entries, things are not too painful: 

Lemma 5.4. Fix k,l so that (k,l) is an exposed entry for CMV matrices. The 
evolution of CMV matrices under the Hamiltonian r k i(B) — Ke(B k i) is given by 

iC={-l) k r kl (C)[E kk C-CE u ]. (55) 

Proof. We will use the notation E k i for the matrix with 1 in the (fc, I) position and 
zeros everywhere else. 

As Vrfe; = iEi k , equation l|23|l implies 

C = Cnt(iEi k C) - ^{iCEi k )C. (56) 

We proceed differently depending on the parity of k. 

As noted in Definition 11.41 the exposed entries with k odd all take the form 
(2j — 1, 2j + l) with j > 1. In this case, both Ei k C and CE\ k are lower triangular and 
the sole entry on the diagonal is positive (its value is Cm). Thus we are interested 
in the value of tti(B) where B is lower triangular with purely imaginary diagonal. 
It is easy to see that in this case, tt[(B) = B — D, where D is the diagonal part of 
B. Consequently, reduces to 

C = —iC k i [CEu — E kk C] 
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in agreement with Ij55(l . 

For k even, iEikC and iCEik are upper triangular with purely imaginary diagonal. 
For matrices B of this form, iri(B) — + D, where D is again the diagonal 
part of B. As C^C = CC^ = Id for CMV matrices, only the contribution from D 
survives. The fact that this now appears with a plus sign results in the (— l) fe factor 
in {55). □ 



6. Ablowitz-Ladik: Evolution of the Spectral Measure. 

Shortly after the discovery of a Lax pair representation for the Toda Lattice ^3 1 
Moser gave a complete solution for the finite system, [21] . Specifically, he discovered 
the angle variables associated to the actions of Henon ^H] and Flaschka In 
addition, he studied the long-time asymptotics and determined the scattering map. 

In this section, we will discuss some corresponding results for the Ablowitz-Ladik 
system. The remaining parts of the analogue of Moser's solution can be found in 
Section0 A special case of Corollarv l6 . 51 has already appeared, [25] • The approach 
used there was to begin with a special case of H7()|l and determine the induced 
evolution on the Verblunsky coefficients. 

(To be precise, Moser did not check that his 'angles' Poisson commute. This 
follows as a special case of ^| Theorem 1].) 

The key observation of Moser was that the spectral measure associated to the 
Jacobi matrix and the vector e\ has a very simple evolution. We will show that 
the same is true for the Ablowitz-Ladik system. To do this, we need a few lemmas; 
those eager to see the result should skip ahead to Proposition ^ . 31 and its Corollaries. 

Lemma 6.1. For any matrices B and C, 

R(B^) = R{B) + B - B^ (57) 
R(BP) = BP - P{B - B^P (58) 
R{[C, P}) = CP + PC- 2C f P - 2P(C - C f )P (59) 

where P is the rank-one projection |ei)(ei|. 

Proof. The first result is very simple: B — B^ is anti-Hermitian and so R(B — B') = 
-B + B^. The result now follows from the fact that R is linear over R. 

Next, notice that -B|ei)(ei| has non-zero entries only in the first column. There- 
fore, the anti-hermitian part is just \P{B — B^)P. Equation (|58|l now follows easily 
from (H2J). 

The value of R{PC) can be deduced from (|58l) by first applying 157|) with B = 
C^P. The result is R{PC) = 2C^P-PC + F(C*-C* t )F. Combining this with {5HJ 
gives {SHI). □ 

Lemma 6.2. Given a polynomial g, let P = |ei)(ei| and tf> : g — » R by 

iP(B) = Re (e 1 \g(B)e 1 ) = (ig{B), P) (60) 
then [B, Vip] = i[g(B),P] and so 

R([B, VV>]) = ig{B)P + iPg(B) + 2ig(B)^P - U${B)P. (61) 
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Proof. As the operations involved are linear over K, it suffices to treat the case 
g(B) = i l B k . By cycling the trace, 



fe-i 

Vip =i l+1 ^2B q PB k - q - 1 . 

9=0 



Thus, the sum in [B,Vtjj] telescopes to give i l+1 [B k , P}. Lastly, Q61(l follows from 
this and Lemma It). II □ 

Proposition 6.3. Given a polynomials f and g, let us define maps from g to R by 

<j>{B) = Imtr{/(B)} and ip(B) = Re (e 1 \g(B)e 1 ). 

Taking (f> as the Hamiltonian, 

iiB) = Re (e 1 \Bf'(B)(g(B) + g(B)*) ei ) - 2^{B) Re (e 1 \Bf(B)e 1 ) (62) 

In particular, if d/i is the spectral measure associated to e\ and a CMV matrix C, 
then writing F(z) — 2Hezf'(z), we have 

d t J Gd^L — J F G d[i - J F d^i J Gd^i (63) 

for any function G : S 1 — > M. 

Proof. From Proposition 14.51 we have 

i>{B) = R(Bf(B))],Vi>) = -%(R(Bf'(B)), [B,V^]) (64) 

= \(Bf\B),R([B,V4,])). (65) 
Now we can apply l|61|) from Lemma fo. 21 to deduce 

i>(B) = (iBf'(B)(g(B)+g(B)l), P) - 2i/>(B)(iBf'(B), P), (66) 
which says the same thing as (|62|) . □ 

Corollary 6.4. Given a pair of polynomials f\ and fi, the Hamiltonians 4>j(B) = 
Imtr{/j(i?)} give rise to the same integral curve through the CMV matrix C if and 
only if 2i-» Re[z/{(z) — zf^z)] is constant on the spectrum of C. 

Proof. From Proposition 14.41 we know that both integral curves remain in the set 
of CMV matrices. As a CMV matrix is entirely determined by its spectral measure 
(cf. Theorem 12. 31) . it suffices to understand whether the two Hamiltonians give rise 
to the same evolution for this measure. From l|63|l . the answer is clear: the integral 
curves are the same if and only if 

J [Fx - F 2 ] Gdfi = J [Fx - F 2 ] dfi Jg d{i (67) 

for all functions G : S 1 — > K. That is, if and only if Fx — F% agrees with a constant 
d/i-almost everywhere. Lastly, the support of dfj, is precisely the spectrum of C. □ 

Since the vector ex is cyclic for any n x n CMV matrix C, the associated spectral 
measure /i is supported at n points. Conversely, if /i is a probability measure on 
the circle which is supported at n points, then by the results in Section [21 it is the 
spectral measure for (C, ex), with C the CMV matrix representing multiplication by 
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z in L?{d[i). It is natural to parameterize the spectral measure, g?/i, in terms of the 
eigenvalues and the mass dfi gives to them: 

~ n 

f{z)dn{z) = Y,f^ 6i )H- (68) 

3=1 

By the observations of the previous paragraph, 6j and \Xj are well-defined on the 
manifold of CMV matrices. 

By Theorem 12 .31 one may view 6i, . . . , (A\, . . . , (J> n —i as a system of coordi- 
nates on the manifold of CMV matrices with fixed determinant. From part (ii) of 
Proposition ^. 5l we know that the functions 9j Poisson commute. In particular, they 
do not change under Hamiltonians of the form 4>(B) = Imtr{/(i?)}. The evolution 
of Hj is easily determined from the proposition above. 

Corollary 6.5. Under the flow generated by 4>{B) = Imtr{/(£>)}, 

n 

d t log[ W ] = {0, log[ W ]} = F(e^ ) - ]T F(e ie >)m (69) 



i=i 

where F{z) — 2Rez/'(z). Consequently, 

_ cxp[F(e' g Qt]^-(0) 
W ~£exp[F(e^)i] M 0) ^ 

and for any j, I £ {1, . . . ,n — 1}, 

{6 U \\og[ N / l l n }}=5 jl . (71) 

Proof. Equation l|69|l amounts to nothing more that l|63|) when G(z) is the charac- 
teristic function of the set {e Rather than bother solving this system of ODEs 
directly, it is simple enough to check that (|7(J|) is indeed the solution. 

Consider with f{z) = z k and f(z) — iz k for k = 1,2, Combining 
pairs gives 

J2 ike Ml {e u log^]} = 2ik[e ike ' ~ J2 

This is a system of equations for {9i, log[fij]} that has a unique solution (the Van- 
dermonde determinant does not vanish): 

{0 z ,log[/^]}=2Sy-2 W . (72) 
Subtraction now gives (|7T|) . □ 



Equation 1)71(1 gives an obvious candidate for a system of canonically conjugate 
(or Darboux) coordinates: 

6i, . . . ,0„_i, |log[/ii//x n ], . . . , |log[/i„_i/At n ]. 

Moreover, the natural analogues for the Toda lattice are canonically conjugate, jl()l 
Theorem 1]. In Section |H1 we will show that the variables log[/Xfc//Lt„] do not com- 
mute and use the values of their brackets to find a system of Darboux coordinates. 



CMV: THE UNITARY ANALOGUE OF JACOBI MATRICES 



2:S 



7. Ablowitz-Ladik: Asymptotics and Scattering. 

In this section, we study the asymptotics of solutions to the equations in the 
Ablowitz-Ladik hierarchy. These asymptotics will be expressed both in terms of 
the spectral measure dfj, and the Verblunsky coefficients. In the former case, the 
answer is an immediate corollary of the results in the previous section. 

Let us fix a Hamiltonian from the Ablowitz-Ladik hierarchy: 

cj>(C) =Imtr (/(C)), (73) 

where / is a polynomial. As in the previous section, we will use the shorthand 
F(z) = 2Re[zf'(z)]. 

We will write the spectral measure as 

n 

f(z)d f x(z) = ^2f(z k ) f x k . (74) 

k=l 

Naturally, C, the spectral measure, and the Verblunsky coefficients evolve under 
the flow generated by </>; we use C(t), /ik{t), and (Xj(t) to denote these quantities at 
time t. By Proposition 14.51 part (ii), the eigenvalues do not change under this flow; 
hence there is no need consider Zf. (t) . When we omit the time dependence, we refer 
to the initial data. 

In defining fj, k and z k , we can choose any ordering we please; however, there is a 
particular condition on this choice that simplifies the formulae below. Namely, we 
require that 

Ai>A 2 >--->A„, (75) 

where we use the shorthand A& = F(zk) — 2 Re[zkf {zk)]- Of course generically, the 
ordering will be strict. Note that by the continuity of F this labeling is well-defined 
on an open set which is invariant under the flow of </>. 

Proposition 7.1. Under the flow generated by the Hamiltonian (f>, the masses have 
the following asymptotics 



log[/i fc (t)] = -(Ax-Afe^ + log 



/'-A: 



Mi H h fii 



0(e~ at ) (76) 



as t — > oo. Here v is defined by \\ = ■ ■ • = A„ > A^+i and a = Ai — A„ + i > 0. In 
particular if k > v , then [i k — > exponentially fast. 

Proof. Equation (|76() is an immediate consequence of (|70|l , which says 

log[Mfc(*)] = Afet + log[/i fc ] - log 
The definition of v implies 

n 

^ eAmt = eAlt (Mi + • • • + M.)[l + 0(e- at )}. 
Combining these two formulae completes the proof. □ 



i=i 



We now turn to studying the asymptotics of the Verblunsky coefficients. In 
order to simplify the formulae that follow, we will use multi-index notation. All 
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our multi- indices will be ordered. Given a multi- index I = (i± < ■ ■ ■ < i{) of length 
I, we write 



m/ = n ^* - 



zi 



n 

fc=i 



and we abbreviate Vandermonde determinants as follows: 

A(Ci,...,C m )-det(Cf- 1 )= [] [Cfc — Cf 

l<j <k<m 

A/ = A(z 2l , . . . 



(77) 



(78) 
(79) 



We follow the natural convention that the Vandermonde determinant of a single 
number is 1. 

The key to converting asymptotic information on the measure into asymptotics 
for the Verblunsky coefficients is the following lemma. 

Lemma 7.2. Let dji = Y], _, /ifc<5 Zfc be a discrete measure on the unit circle and 
let ft{p) — J z p dfi(z). Then for each 1 < m < n, 



m-2 



det[A(fc-0] =]Tl A 'lV/= II 



j=0 



2{m-]-l) 



m-2 



(80) 



det[A(fc - I - 1)] = Yl = (-ir~W-i n P? Cro_i_1) , (81) 

/ 0=0 

where the determinants are taken over 1 < fc, I < m and both sums are over ordered 
multi-indices I = (1 < i\ < ii < ■ ■ ■ < i m < n). 

Proof. The proof is an elementary application of a special case of the Cauchy-Binet 
Formula (see ^3 p. 9] or |321 Lemma 36.2]): Suppose m < n. Given an m x n 
matrix A and a n x n diagonal matrix D, 



f Ton 

det(ADA^) = \ l det (^MJ| II D v.v f ■ 

ii<---<i m r=l ' 

To prove the first equalities in (18011 and (|81|) . we choose 



(82) 



1 

zi 



1 

Z2 



m— 1 m — 1 



v m — 1 



with D — diag[/zi, . . . , /i n ] and D = diag[/iizi, . . . , fJ, n Zn\, respectively. The right- 
hand side of l|82|) reduces to the form given above because of l|78|) . Evaluating the 
matrix products ADA^ leads to the Toeplitz matrices given on the left-hand sides 
of JHDl and JHD- 

To prove the second equality in H8()|) . we apply row operations to A: 



$o(zi) 



EA 



rn — 1 



(*0 



m— 1 



(Zn)_ 
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for some lower-triangluar matrix E of determinant one. The orthogonality of &j 
with respect to d\i shows that EADA^E^ is a diagonal matrix with entries H^H 2 - 
The product of the squared norms of reduces to (|8U|) via . 

The second part of (|%T|) follows from (|5U|> by the Heine Formula, [2H1 Theo- 
rem 1.5.11], which says (— l) m_1 a m _i is the ratio of the left-hand sides of i|8U|) 
and JHU). □ 

Theorem 7.3. Fix 1 < k < n — 1. Let B k = {I : A; = Afc}, let s(k) be the largest 
index not contained in B k . Then, as t — > oo, 

a k -i{t) -> [-1) zj , (83) 

2^ \&juiF(J>i 

where J = (1 < 2 < • • • < s(fc)) and &oi/i sums are over all ordered multi-indices 
I C _B fe o/ length k — s(k). In particular, |aifc_i| — ► 1 if and only if Afc > Xk+i- 
If all Xj are distinct, 

a fc _i(t) = (-I)*" 1 * ■~z k [l + a-ie- (Afc - Afc+l)t + 0(e-T*)] (84) 

w/iere 



fc-i 



£ fe _i = (z fe z fe+ i - TT 



2fe+i - 



(85) 



and 7 > (A fe - A fe+1 ) > 0. 

Proof. By taking the ratio of l|80|) and Q8ip. we obtain 

afc - lW - ( " 1} EIAlI^lW (86) 

where the sums are over all ordered multi-indices, L, of length fc. 

The key to proving i|83|) is determining the asymptotically dominant terms in 
these sums. By Proposition l7.il if X p > X q , then the ratio fi q (t) / n P (t) converges 
to zero exponentially fast. Therefore, the dominant terms arise from those multi- 
indices for which Aj t + ■ • • + Xi k is maximal. As the As are decreasing, the maximal 
value is Ai + • • • + Afe and the set of multi-indices that achieve this value is exactly 
the collection of J U I stated in the theorem. This shows that 

1 S l A Juj| 2 MJui (t)zjui , R7 , 

&k—l\t) ~ (, 1J ^ i A [2 TTn ) ( 8 <) 

L l A Ju/rA*,/u/(i) 

where A w _B means that the ratio A/_B — > 1 exponentially fast. (We will maintain 
this convention throughout of the proof.) Note that zj can be factored out, leaving 

Zj. 

By Proposition 

fJ-Jui(t) ~ -, — J 1 ^ 1 — exp[(Ai H h Afe)t - kXxt] 

for the multi-indices / under consideration. Substituting this into (|87() and can- 
celling common factors in the numerator and denominator gives 11831 . 

From i|83[) we see that the limiting value of (—l) k ^ 1 zjak-\{t) is a convex combi- 
nation of points on the unit circle. This sum contains exactly one term if and only 
if Afc > Ak+i. Therefore under this condition, \au-i(t)\ — > 1. If the sum contains 
more than one term, then it contains two multi-indices that differ at only one index. 
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Thus the limiting value of a k -i(t) is a non-trivial convex combination of points on 
the unit circle and so lies strictly inside the unit disk. 

We will now consider the case of distinct Xj. This amounts to studying the 
asymptotics of fiL(t) for different choices of ordered multi-indices, L = (1% < ■ ■ ■ < 
Ik)- By Proposition 

fe 

logM*)] « - A t .) +log[ M i] - fclogM 

3=1 

and so the question reduces to finding the largest two values of A; x + • • • + A/ fc . 
Because Ai > A2 > • • • > A„, the largest value is Ai + • • • + X k ; the second largest 
is Ai + • • • + Ak-i + Xk+i- All other choices of L lead to strictly smaller values for 
the sum. We write 7 for the difference between the largest and third largest values 
of this sum. 

Returning to (|87|) and factoring out the dominant term in both the numerator 
and the denominator gives 

, fc _i . _l + z k z h+1 P k e-( x "- x ^ t + 0{e-^ t ) 



(-1) 1 zi---z k a k _ 1 



1 + p k e-(M-^ k +i)t + 0( e -7<) 
1 + (z k z k+1 - l)p fce -( Afc - Afc +^ t + 0(e-^), 



where 7 = min{7, 2(X k — A^+i)} and 



A(zi,...,z k -i,z k+ i) 



2 



A(zi, . . . , z k ) 

Cancelling common factors in the Vandermonde determinants leads to the formula 
given in the theorem. □ 

If all Xj are distinct, then viewed as a curve in the disk, a k -i(t) approaches the 
boundary in a fixed non-tangential direction. This simply amounts to the statement 
that £/c_i is non-zero and arg(^_ 1 ) = a,rg(z k z k+ i — 1) belongs to (— tt/2, n/2). Let 
us also note that the asymptotics of p k -\ are easily deduced from 18411 : 

p\_ x {t) = -2Re(a-i)e" (A - At+l)i + 0(e^*) (88) 



\z k +i — z k \ 



fc-1 



^ 1=1 



z k +i - Zl 



Zk - Zl 



2 



-(\ k -\ k+1 )t 1 cilp-it 



0(e^ f ). (89) 



This shows that the factors C(t) and A4(t) of the CMV matrix C(t) diagonalize 
as t — > 00 and hence so does C(t). Moreover, the eigenvalues are ordered by the 
corresponding value of F(z). This is a well known phenomenon for the Toda Lattice. 

When the Xj are not all distinct, C{t) converges to a direct sum of CMV matrices 
and their adjoints. Specifically, if 

A/t-i > Afe = • • • = X k +m > Afe+ m +i, 

then ot k -\, . . . , ak+m-2 do not approach the boundary and C(oo) has a non-trivial 
block of size m beginning at row/column k. If k is odd, this will be a CMV matrix; 
if k is even, it will the adjoint of a CMV matrix. 

While this phenomenon cannot occur for the Toda Lattice, it can occur for 
Hamiltonians in the same hierarchy. Some examples of non-diagonalization are 
discussed on page 389 of 0. However, we do not know if anyone has troubled to 
write down the analogue of Theorem l7.3l 
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We now turn our attention to scattering; we will only consider the case of distinct 
As. Thus far, we have only discussed the behaviour as t — ► +00; however the 
behaviour for t — > — 00 can easily be deduced from this by reversing the sign of the 
Hamiltonian. This results in the following changes to the t — > 00 formulae: 

Afc 1 * — A„_fc + i, Zk 1— > z„_fc + i, and /z^ 1— > /i n _fc + i 

In particular, 

ak_i(f) = (-l) fe_1 £„ ■ • • z n - k +i 1 + Cfe-ie 
as t — > — 00 where 

Cfc-l = ( z n-fe+l^n-fe 



0(e 



7* > 



^ f^n — k 



n 

Z=n-fc+2 



Zl 



Zn-k+1 



Zl 



(90) 



and 7 > (A„_ fc - A„_ fc+ i) > 0. 

The scattering map is the transformation that links the asymptotics of solutions 
as t — > — 00 to those as t — > 00. The result follows easily from the formulae given 
above: 

Proposition 7.4. Suppose all Xj are distinct, then 

a fe _i(+oo)a n _ fc _i(-oo) = (-l)"^i • • • z n = (-l)"det(C t ) = -a„_i, 
which is independent oft. Furthermore, 



k-l 



£fc-lCn-fc-l — l z fc+l — z k\ 2 Y[ 



1=1 



Zk+l — Zl 



Zk - Zl 



n 



=k+2 



Zk 



Zk+l 



The eigenvalues Zj are time independent and can be recovered from the asymptotic 
values of the Verblunsky coefficients as follows: 

ay_ 2 (+oo) a n _ J -_i(— c») 



*j_i(+oo) 



x„_j(-oo) 



The corresponding result for the Toda Lattice can be found in [21 §4]. 

As noted earlier, the Toda flow diagonalizes Jacobi matrices. One may ask if it 
suggests a good algorithm for doing this. Indeed it does, namely the venerable QR 
algorithm. See [TT1[T3| . 

For unitary matrices, the naive QR algorithm is useless: the QR factorization 
of U is U ■ Id. However, by incorporating shifting and deflation one does obtain an 
algorithm that works. In view of parts (iv) and (v) of Proposition ^. 51 any algorithm 
based on the Ablowitz-Ladik flows will merely amount to the application of the 
QR method to a (possibly step dependent) function of U. We do not wish to be 
entirely pessimistic, particularly in light of the storage savings afforded by CMV 
matrices. One may try the following algorithm: Iterate 



C i-> QCQ ] where C±C f = LQ. 



The choice of sign may be fixed or vary. Whether this has any real numerical 
benefit, is a matter for experiment. 
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8. Canonical coordinates 

In this section, we extend Q\, ... , 9 n -\ to a system of Darboux coordinates; these 
Poisson commute by Proposition ^. 51 Based on (|71|l it is natural to imagine that 

\ log[/il/Mn], ■ • • , \ log[Mn-l/Mn] (91) 

are the missing variables as is true for Jacobi matrices (a special case of ^3 The- 
orem 1]). As we will see, they are not, because they do not Poisson commute. The 
remedy is well known (see |SJ Ch. 4]): the conjugate variables can be written in 
the form 

fi = |log[Mi/Ai«] +fl(®l,-- - ,0n-i) 
where fi are determined by the system of differential equations 

W k ~lk = {5 lo s[/WM!iog[/V Mn ]}. (92) 

This system is soluble via the Poincare lemma. 

We begin by calculating the right hand side of l|92|l . There doesn't seem to be 
any easy way to do this. Because of the permutation symmetry in the indices, it 
suffices to compute {log[^ 2 /MiL IosIaWmi] }• 

Lemma 8.1. For any enumeration of the eigenvalues e 1 ® 1 , . . . , e l9 ™ , the functions 

*i,fe(C) = {log\pj/fj,i},\og[iJ, k /fj,i}} 

Poisson commute with all Hamiltonians of the form Imtr/(C) and so remain con- 
stant under the corresponding flows. 

Proof. By the Leibnitz rule, it suffices to prove that V&j.fc commutes with 9i for each 
I = 1, . . . , n. For this case, we merely need to look to the Jacobi identity and {721 : 

- { { , logK/Mi] } . 0/ } 

= {{log|>k/M0j},log|>i/Mi]} + {{^logtMj/Mi^logE/ife/Mi]} 
= {-2<5 ;fc ,log[/^/^i]} + |2<5 ;i ,log[^ fc /^i]} 
= 0, 

which completes the proof. □ 

Lemma 8.2. Fix distinct points Cii • • • > Cn on the unit circle and consider the col- 
lection of CMV matrices with determinant Y[ Ck that lie within the e -neighbourhood 
of diag(Ci, • • • , Cn) as measured in the Hilbert-Schmidt norm. For e sufficiently 
small, there are analytic functions G and Gj of the 3(n — 1) variables ctk, otk, pk, 
< k < n — 2, so that 

Hi = 1 - pIG{ol,6l,p) (93) 

log[£] = log[pg] + logt ^^ ] + J2 Pi°i^ "< Pi ( 94 ) 
where fij is the mass associated to the eigenvalue nearest Q. 

Proof. Let D denote the diagonal of the CMV matrix and E, the remainder: C — 
D + E. We will also write Pj for the orthogonal projection onto the space spanned 
by 6j. These are, of course, the eigenprojections for D. 

As the Q are distinct, we can construct disjoint circular contours around each 
of these points. We orient these in the anti-clockwise direction and label them Fj. 
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For e sufficiently small, each contour encircles exactly one eigenvalue of C and 
the corresponding eigenprojection is given by 



(*-D-E)- i $Z = Pj 



(z-D^Eiz-D^M 



■j>&- D)- 1 E{z -D- E)- 1 E{z - D)~ 



1 dz 
2-rri ' 



The second equality follows from two applications of the resolvent identity. 

Now fij — tx{P\Pj) — (ei|Pj-ei) and so using the fact that Dn = oto, we obtain 

dz 



Mi = *u + £lf r (ei\E(z -D- E)~ l Ee\) 



ir, {z ~ ao) 2 

The term that is linear in E disappears because it has only a double pole when 
j = 1 and is analytic when j ^ 1. As Ee\ = poC2 and E^e\ — a\pQ&2 + PoPi^3, 

dz 



«ie 2 + Pie 3 |(z - D - J5) ^2)7 r^~- 

r (z-a ) 2 



(95) 



To see that the integral gives an analytic function of a, a, and p, we need merely 
write (z — D — E)^ 1 = XX Z ~ D)~ 1 [E(z — D)~ 1 ] 1 , which is norm convergent 
in a neighbourhood of Tj provided \\E\\ < e is sufficiently small. Taking j = 1 
immediately gives (I93[) . 

Elementary manipulations show that Ij94(l follows from 



2 Qi 

/ i2 = Po (a +a lQo ) 2 



(96) 



for some analytic functions Gj. To prove this, we simply need to apply the resolvent 
formula one more time. From (|95|l and D22 = —cxqcxi, 



P-2 



(a ai + a ) 2 



Pi 



r 2 



(«ie2 + pie^Kz — D — E) x Eei) dz 
2iri(z + a ai)(z - a ) 2 



As Ee2 = poaiei + pia2e3 + pi/92e4, an additional factor of po or pi can be extracted 
from the integral to give H96JI . □ 



Lemma 8.3. Under the hypotheses of Lemma \H. l 3. there exist analytic {unctions 
Hj of the 3(n — f ) variables otk, elk, pk> < k < n — 2, so that 



l0g[^] = l0g[p 2 p 2 ] + log 



^2pjHj(cx,a,p), 



(97) 



where ( consistent with the notation of Section \$l 

$2(2) = z 2 + (a ai - a )z - oti 
and p,j is the mass associated to the eigenvalue nearest Q . 
Proof. In view of (|93|l , it suffices to show that 



M3 = pIpI 



$ 2 (-aiQ!2) 2 



Y^PjHj{oL,a,p) 



(98) 



for some analytic functions Hj. This result does not follow by using the resolvent 
formula once more in the previous proof; instead, we will use a block decomposition. 
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Let us write C = [ ^ D \ E ] where 



A = 


a 


P0&1 


, B = 


P0P1 




Po 


-a a 1 _ 




-a pi 





Pl&2 P\P2 



and D + E is the splitting of the remaining block into its diagonal and off-diagonal 
parts. By the well known formulae for block matrix inversion, the top left 2x2 
block of (z — C) _1 is given by 

- A)- 1 + (z- A)~ 1 B{z — D —E — C(z - Ay 1 B]- l C{z - A)' 1 , 

which is relevant because as in the previous lemma, 

P3=2hf (ei\(z-C)- 1 e 1 )dz. 



For e sufficiently small, A has no eigenvalues inside T3 and so the first summand 
above does not contribute. 
To continue, we calculate 



(z-A)- 1 



1 



$20) 



z + «o«i Po&i 
Po z - a 



where det(z — A) = <& 2 (z) = z 2 + (a^ai — ag)z — ct\. This immediately implies that 



C{z-A)- 1 e 1 



PoPi 
$ 2 (z) 



[a 2 ei + p 2 e 2 \ 



and 



el(z-Ay 1 B 



zpoPi T 



Putting all these formulae together gives 

M3=PoP? / (e 1 \[z-D-E-C(z-A)- 1 B}- 1 (a 2 e 1+P2 e 2 ))^^ v . 

From here we proceed as usual: let E = E + C(z — A)~ 1 B and apply the resolvent 
identity (z — D — E)- 1 = (z - D)^ 1 + (z - D)- 1 E(z - D - E)- 1 in the equation 
above. As D\\ — C33 = —a \a 2 , the first term gives rise to 



pIp\&iI {e x \{z-D) 1 e x ) Z ^ Z 

Jr 3 2m^ 2 {zy 



2 2 
P0P1 



-ai«2 



& 2 (—aia 2 ) 2 ' 

which is exactly the dominant term in (|98|l . The remaining term can be rewritten 

pIpI <f (ei\E(z -D- £)- 1 (a 2 ei + p 2 e 2 )) . > , „ 

Jr 3 2tti(z + aia 2 )9 2 (zy 

which is of the form Po/°i X) PjHj(ot,a, p), because every entry of E contains at 
least one factor pj. □ 

Proposition 8.4. For any labelling of the eigenvalues, 

{log[ M2 /Mi],log[M3/Mi]} = 2cot(^)+2cot(^)+2cot(^ i ) (99) 
in the Gelfand-Dikij (or Ablowitz-Ladik) bracket. 
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Proof. We can choose a polynomial / so that F(zj) are all distinct and 

F{z x ) > F(z 2 ) > F(z 3 ) 

are the three largest values. (We remind the reader that F(z) — 2Rez/'(z).) By 
Lemma f8.ll {i \og[p 2 / pi], \ log[/X3//zi]} is conserved under the flow generated by 
4>{C) = Imtr/(C). The key idea is to use results from Section [7| to evaluate this 
bracket as t — > oo. 

For t sufficiently large, we can apply Lemmas 18 . 21 and fOl because C(t) converges 
to a diagonal matrix under the 0-flow. Thus we need only show that 



{ lo g[Po]+ lo g 



oi 



(a + a x aof 



-Ot-yOt\ 



,Pj H j} 



converges to the right-hand side of Ij99(l as t — > oo. To do this we employ linearity, 
the Leibnitz rule, and the following consequences of l|42|l : 

{log[p m ],p m /i(a m )} = {p m ,h(a m )} = -ip m h'(a m ) -> 

{log[pm\, p m h(a m )} = {pm,h(a m )} = ip m h'(a m ) — > 

{h(a m ),g(a m )} = -2ip 2 m ti (a m ) 7 g' (a m ) -> 

for any analytic functions h and g. Of the nine terms in the original bracket, this 
kills off all but two: 



|log[pg], log 



-aia 2 . 



<&2(— CtlQ!2) 



} + {log 



Oil 



(ato + aiao) 2 



log[PoPi]}- 



These can be computed using 1)32} . The result is 



-U 



aia 2 {aoai + ap ) r,„ a oai - «o 

<I>2(-Ckia2) 



JQi 



From Theorem 17. 31 

Q'O 

-aia 2 -> e^ 3 , 
as i — > oo. By using these formulae, 

{log[/i 2 /Mi]>log[M3/Mi]} = 4 *- 
The rest is trigonometry. 



-aoai 
#2(2) 



2i- 



ao«i + ap 



oi0i 



)(e* 



2i- 



□ 



While the proposition is stated with fixed indices (1,2,3), the arbitrariness of 
the labelling immediately gives the result for any triple of distinct indices. The 
result is 

>!'., 

(100) 



r := {\og[p q /p s },log[p r /p s }} 

= 2cot(^^) +2cot(^^) +2cot(- 



Note that ^q, r ,s is invariant under cyclic permutations of (q, r, s) and odd under 
transpositions. We adopt the convention that ^ q , r ,s vanishes if two indices coincide. 

We will now give formulae for the functions /; discussed at the beginning of this 
section. Let e ir > = det(C) and G(x) = 2 cot (art/2) di. Then 



n-l 

E 

k=l 



) = Y j e k [G{e k - Qi) + G(e t + v - e n ) + G(d n - v - e k )] (101) 
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solve the system l|92l) . Given any scalar function g, another solution would be 
fi + Jgj; however, we have found no choice of g that leads to a simpler formula. It is 
possible to compute the integral defining G, but this is not an elementary function — 
it involves dilogarithms. Verifying that do indeed solve Ij92(l is unenlightening; 

the key observations are that G is odd, G' is even, and that xG'(x) + 2G(x) — 
2cot(x/2). 

While it is not immediately obvious, Proposition l8.4l does permit us to determine 
Proposition 8.5. With * as in (llOOjl . 

n 

{logKUogW} =^2^q,r,k- (102) 

k=l 

Proof. Let us write M k i = {log[^fc], log[^;]}. Regarding \i\ as a function of the 
remaining masses, the Leibnitz rule gives 

n n 

Multiplying through by and summing over k = 2, . . . , n gives 

n n n n 

V Mfe^fe.M = V) VkM kl + (1 - m ) V f^M pl + y ^M kp . 

— — — Ml z — ' Ml 

fc=2 fe=2 p=2 ^ 1 fc,p=2 ri 

The last (double) sum vanishes because M kp is antisymmetric and so we can deduce 

n ^ n 

^Wfe*fe,i,i = — y^pMpi = -{log[ui],log[// z ]}. 

fc=2 ^ p=2 

The result now follows by relabelling indices. □ 
In an earlier paper, j21j . we determined the Jacobian of the map 

(6i,fjLi,...,0 n -i,fJt n -i,O n ) ' ^ (uo,Vo,...,Un-X,V n -2,<t>) (103) 

where = u k + iv k and a„_i = e Ic ^. The analogous result for Jacobi matrices is 
due to Dumitriu and Edclman, which served as our guide. 

We asked for a simpler, more direct derivation of the Jacobian. A recent preprint 
of Forrester and Rains, |15| . gives a very direct solution. Percy Deift showed us 
a derivation of the Jacobi matrix result using the symplectic structure naturally 
associated to the Toda lattice. We would like to close with the corresponding proof 
for CMV matrices. The key idea is the following: As we can write the underlying 
symplectic form in either set of variables, we can view l|l(J3|) as a symplectomorphism 
between two concrete symplectic manifolds. In particular, it must preserve the 
Liouville volume. 

Corollary 8.6. The Jacobian of the change of variables <|1U3|) is given by 



det 



d(uo,V ,...,<j>)] d-nPo ' ' -Pn-2 



9(6*1, ui, ...,6 n ) 



-2 1 



Ul • • • /!,, 
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Proof. The whole proof amounts to writing the symplectic volume in each set of 
coordinates. In terms of the Verblunsky coefficients, the answer can be read off 
from 



LU A ■ ■ ■ ALU = 



Yl-2 

n 

Lfc=0 



Pk 



oIuq A dva A ■ ■ ■ A du n ^2 A dv n . 



(104) 



To determine the volume in terms of 9k and fik we begin by using the Darboux 
coordinates 9k,(j)k constructed in this section: 

w A • • • A to = ddi A d(f>i A ■ ■ ■ A d6 n -i A d(j) n -i 

= det( J) d0\ A dfii A ■ ■ ■ A d8 n -i A cfyi„_i 

where J is the 2(n — 1) X 2(n — 1) block matrix 



J 



Id 



90, 



K 



with K u = a^llogt/ife/Mn] = 1 &ki + \^ n 1 ■ 



It is not difficult to evaluate det(-fT) by Gaussian elimination: first subtract the 
first column from all other columns, then for each 2 < k < n — 1, subtract /ik/l^n 
times the fcth column from the first. The result is an upper triangular matrix whose 
diagonal is [^- + J L-+J2l~l zfifc, 5^, •• 5^]. Recalling that f n+- ■ -+^ n = 1, 
the product simplifies to give 



u A •• • A u 



,fe=i 



d0\ A d(j,\ A ■ ■ ■ A d8 n ^i A d/i n _i 



In order to add the last coordinate, recall that (—1)™ l a n -i = det(C) = ]^[e* 6 ' !c . 
Therefore — d(f> = d9i + ■ ■ ■ + d0 n and so 



n-1 

n 



Lfc=0 
2 l-n 



dito A <iwo A • • • A du n ^2 A dv n -2 A 1 



n^ 1 



.fe=i 



d#i A dfi% A ■ ■ ■ A dQ n -i A d/i n _i A d0 r , 



which implies the claim by the change of variables formula of differential geometry. 

□ 

Remark 8.7. By placing the result at the end of such a long paper, we have 
perhaps given the impression that this derivation of the Jacobian must be the most 
complicated of all. However, all that we really used was the commutativity of 
the eigenvalues and (|71fl . If one remains single-minded, these can be derived very 
quickly from either definition of the bracket l|19|) or (|42|l . 
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